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Abstract 

Many computer vision algorithms employ subspace 
models to represent data. The Low-rank representation 
(LRR) has been successfully applied in sub space clustering 
for which data are clustered according to their subspace 
structures. The possibility of extending LRR on Grass¬ 
mann manifold is explored in this paper. Rather than di¬ 
rectly embedding Grassmann manifold into a symmetric 
matrix space, an extrinsic view is taken by building the self¬ 
representation of LRR over the tangent space of each Grass- 
mannian point. A new algorithm for solving the proposed 
Grassmannian LRR model is designed and implemented. 
Several clustering experiments are conducted on handwrit¬ 
ten digits dataset, dynamic texture video clips and YouTube 
celebrity face video data. The experimental results show 
our method outperforms a number of existing methods. 


1. Introduction 

In modern computer vision research, one always models 
a collection of data samples as a subspace. For example, 
the set of handwritten digital images of a single digit from a 
particular person can be modeled as a subspace, which can 
be defined by several top principal components revealed by 
PC A (principal component analysis) [38]. Alternatively, a 
video clip can be viewed as a collection of data samples 
(frames), thus it is natural to regard the video clip for a par¬ 
ticular scene/action as a whole to be modeled as the sub¬ 
space that spans the observed frames [17, 36]. The data 
samples produced from video data or image sets in this way 
are in fact the subspaces, rather than the data points in Eu¬ 
clidean spaces. 


Our main motivation here is to develop new methods for 
analysing video data and image sets through subspace rep¬ 
resentations, by borrowing the ideas used in analysing data 
samples in linear spaces. It should be mentioned that the 
clustering problem to be researched in this paper is different 
from the problem in the conventional subspace clustering 
where the fundamental purpose is to cluster data samples in 
the whole Euclidean spaces into clusters according to their 
subspace structures. That is, the objects to be clustered are 
samples in the main linear space while in this paper the ob¬ 
jects themselves are subspaces (of the same dimension), i.e., 
the points on the abstract Grassmann manifold [2, 9]. The 
relevant research can be seen in e.g. [5]. 

The subspace clustering has attracted great interest in 
computer vision, pattern recognition and signal processing 
[10, 40, 42, 43]. Vidal [37] classified subspace clustering 
algorithms into four approaches: algebraic [20, 21, 29], sta¬ 
tistical [13, 34], iterative [19, 35] and spectral clustering 
based [10, 23, 27]. The procedure of spectral clustering 
based methods consists of two steps: (1) learning a simi¬ 
larity matrix for the given data sample set; and (2) perform¬ 
ing spectral clustering to categorize data samples such as 
K-means or Normalized Cuts (NCut) [32]. 

Two representatives of spectral clustering based methods 
are Sparse Subspace Clustering (SSC) [10] and Low Rank 
Representation (LRR) [27]. Both SSC and LRR rely on 
the self expressive property in linear space [10]: each data 
point in a union of subspace can be efficiently reconstructed 
by a combination of other points in the data. SSC fur¬ 
ther induces sparsity by utilizing the h Subspace Detection 
Property [8] in an independent manner, while LRR model 
considers the intrinsic relation among the data objects in 
a holistic way via the low rank requirement. It has been 
proved that, when the high-dimensional data set is actually 
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composed of a union of several low dimension subspaces, 
LRR model can reveal this structure through subspace clus¬ 
tering [28]. 

However the principle of self expression is only valid and 
applicable in classic linear spaces. As mentioned above, the 
data samples on Grassmann manifold are abstract. It is well 
known [ 7 ] that Grassmann manifold is isometrically equiv¬ 
alent to the subspace of symmetric idempotent matrices. 
To get around the difficulty of SSC/LRR self expression in 
abstract Grassmann setting, the authors of [38] embed the 
Grassmann manifold into the symmetric matrix manifold 
where the self expression can be naturally defined, thus an 
LRR model on Grassmann manifold was formulated. 

In this paper, we review the geometric properties of 
Grassmann manifold and take an extrinsic view to develop 
a new LRR or SSC model by exploring tangent space struc¬ 
ture of Grassmann manifold. The new formulation relies on 
the so-called Log Mapping on Grassmann manifold through 
which the self expression of Grassmann points can be im¬ 
plemented in the tangent space at each particular Grass¬ 
mann point. By this way, we extend the classic LRR model 
onto Grassmann manifold, namely GLRR, so LRR can be 
used for representing the non-linear high dimensional data 
and implement clustering on the manifold space. 

The primary contribution of this paper is to 

1. formulate the LRR model on Grassmann Manifold by 
exploring the linear relation among all the mapped 
points in the tangent space at each Grassmann point. 
The mapping is implemented by the Log mapping on 
the Grassmann manifold; and 

11. propose a practical algorithm for the solution to the 
problem of the extended Grassmann LRR model. 

The rest of the paper is organized as follows. In Section 
2, we summarize the related works and the preliminaries 
for Grassmann manifold. Section 3 describes the low rank 
representation on Grassmann Manifold. In Section 4, an al¬ 
gorithm for solving the LRR model on Grassmann manifold 
is proposed. In Section 5, the performance of the proposed 
method is evaluated on clustering problems of three public 
databases. Finally, conclusions and suggestions for future 
work are provided. 

2. Preliminaries and Related Works 

In this section, first, we briefly review the existing sparse 
subspace clustering methods, SSC and LRR. Then we de¬ 
scribe the fundamentals of Grassmann manifold which are 
relevant to our proposed model and algorithm. 

2.1. SSC [10] and LRR [27] 

Given a set of data drawn from an unknown union of 
subspaces X = [xi, X 2 ,x^v] G where D is the 


data dimension, the objective of subspace clustering is to 
assign each data sample to its underlying subspace. The 
basic assumption is that the data in X are drawn from union 
of K subspaces of dimensions 

Under the data self expressive principle, it is assumed 
that each point in data can be written as a linear combination 
of other points i.e., X = XZ, where Z G is a ma¬ 

trix of similarity coefficients. In the case of corrupted data, 
we may relax the self expressive model to X = XZ + E, 
where E is a fitting error. With this model, the main pur¬ 
pose is to learn the expressive coefficients Z, under some 
appropriate criteria, which can be used in a spectral cluster¬ 
ing algorithm. 

The optimization criterion for learning Z is 

min ||E||2 + A||Z||„ s.t. X = XZ + E, (1) 

Z,E 

where p and q are norm place-holders and A > 0 is a penalty 
parameter to balance the coefficient term and the recon¬ 
struction error. 

The norm || • \\p is used to measure the data expressive 
errors while the norm || • \\q is chosen as a sparsity inducing 
measure. The popular choices for || • ||p is Frobenius norm 
II • IIF (or II • II 2 ). SSC and LRR differentiate at the choice 
of II • llg. SSC is in favour of the sparsest representation 
for the data sample using ii norm [37] while LRR takes a 
holistic view in favour of a coefficient matrix in the lowest 
rank, measured by the nuclear norm || • ||*. 

To avoid the case in which a data sample is represented 
by itself, an extra constraint diag(Z) = 0 is included in 
SSC model. LRR uses the so-called ^ 1/^2 norm to deal 
with random gross errors in data. In summary, we re-write 
SSC model as follows, 

min||E||^ + A||Z||i, s.t. X = XZ + E,diag(Z) = 0, 

Z,E 

( 2 ) 

and LRR model 

min ||E||2 ,2 + A||Z||*, s.t. X = XZ + E. (3) 

Z,E ' 

2.2. Grassmann Manifold 

This paper is concerned with points particularly on a 
known manifold. In most cases in computer vision appli¬ 
cations, manifolds can be considered as low dimensional 
smooth “surfaces” embedded in a higher dimensional Eu¬ 
clidean space. A manifold is locally similar to Euclidean 
space around each point of the manifold. 

In recent years, Grassmann manifold [3] has attracted 
great interest in research community, such as subspace 
tracking [33], clustering [5], discriminant analysis [14], and 
sparse coding [15, 18]. Mathematically Grassmann mani¬ 
fold G{p,d) is defined as the set of p-dimensional subspaces 


in Its Riemannian geometry has been recently well in¬ 
vestigated in literature [1, 2, 9]. 

As a subspace in Q (p, d) is an abstract concept, a con¬ 
crete representation must be chosen for numerical learning 
purposes. There are several classic concrete representations 
for the abstract Grassmann manifold in literature. For our 
purpose, we briefly list some of them. Denote by the 
space of all d X p matrix of full column rank; GL{p) the 
general group of nonsingular matrices of order p; 0(p) the 
group of all the p X p orthogonal matrices. 

/. Representation by Full Columns Rank Matrices [ 1 ]: 

a(p,d) = Mf7Gi:(p) 

11. The Orthogonal Representation [9]: 


Both LLE and LTSA exploit the “manifold structures” 
implied by data. In this setting, we have no knowledge 
about the manifold where the data reside. However in many 
computer vision tasks, we know the manifold where the 
data come from. The idea of using manifold information to 
assist in learning task can be seen in earlier research work 
[26] and the recent work [16]. 

To construct the LRR model on manifolds, we first recall 
LLE algorithm. LLE relies on learning the local linear com¬ 
bination Wij^j (here we use all the other data as 

the neighbours of for convenience) under the condition 
= 1 (assume wu = 0). Hence 

'^WijXj -Xi = '^Wij{Xj - Xi) ^ 0. (5) 

j 3 


g{p,d)^o{d)/o{p)xO{d-p) 

III. Symmetric Idempotent Matrix Representation [7]: 

g{p,d) ^ {P G = P,P^ = P,rank(P) = p}. 

Many researchers adopt this representation for learning 
tasks on Grassmann manifold, for example, Grassmann 
Discriminant Analysis [14], Grassmann Kernel Methods 
[17], Grassmann Low Rank Representation [38], etc. 

IV. The Stiefel Manifold Representation [9]: Denote by 
sr{p, d) = {X e : X^X = 4} the set of all the 
p dimension bases, called Stiefel manifold. We identify a 
point S on Grassmann manifold g{p,d) as an equivalent 
class under orthogonal transform of Stiefel manifold. Gen¬ 
erally we have 


g{p,d)^ST{p,d)/o{p). 

Two bases Xi and X 2 are equivalent if there exist an 
orthogonal matrix Q G 0(p) such that Xi = X 2 Q. 

In this paper, we will work on the Stiefel Manifold Rep¬ 
resentation for Grassmann manifold. Each point on the 
Grassmann manifold is an equivalent class defined by 

[X] = {XQIX^X = Jp, Q G 0{p)} (4) 


We note that in Euclidean case Xj — Xi can be re¬ 
garded as an approximated tangent vector at point X^. Eqn 
(5) means that the combination of all these tangent vectors 
should be close to 0 vector. It is well known that on general 
manifold the Euclidean tangent Xj — Xi at X^ can be re¬ 
placed by the result of the Log mapping Logx. (Xj) which 
maps Xj on a manifold to a tangent vector at X^. Thus the 
data self expressive principle used in (1) can be realized on 
the tangent space of each data as 

y^WyLogxjXj) « 0. 

j 

This idea of moving linear relation over to tangent space 
has been used in [6, 12]. The above request was also ob¬ 
tained in [41] by the approximately defined linear combina¬ 
tion on manifold for the purpose of dictionary learning over 
Riemannian manifolds. 

Einally we formulate the SSC/LRR on manifold as the 
following optimization problem: 


N N 


E {Xj 


•A||IL||, 


i=i j=i 
N 

s.t.^Wij = l,i = 
i=i 


( 6 ) 


where X, a representative of the point [X], is a point on 
the Stiefel manifold, i.e., a matrix of with orthogonal 
columns. 

3. LRR on Grassmann Manifold 

3.1. The Idea and the Model 

The famous Science paper by Roweis and Saul [31] 
proposes a manifold learning algorithm for low dimension 
space embedding by approximating manifold structure of 
data. The method is named locally linear embedding (LLE). 
Similarly a latent variable version. Local Tangent Space 
Alignment (LTSA), can be seen in [44]. 


where Log.(') is defined as the Log mapping on the man¬ 
ifold and '^ij — 1 affine constraints to preserve 

the coordinate independence on manifold same as the term 
of the nonlinear sparse representation in [41]. When || • ||g 
is II • 111, we have SSC model on manifolds, and when || • ||g 
takes II • II*, LRR on manifolds is formed. In the sequel, we 
mainly focus on LRR on Grassmann manifold. 

3.2. Properties of Grassmann Manifold Related to 
LRR Learning 

Eor the special case of Grassmann manifold, formulation 
(6) is actually abstract. We need to specify the meaning of 
the norm || • ||x and Logx(Y). 






3.2.1 Tangent Space and Its Metric 

Consider a point S = span(X) on Grassmann manifold 
d) where X is a representative of the equivalent class 
defined in (4). We denote the tangent space at S by 
TsG{p, d). A tangent vector 1-L G TsG{p, d) is represented 
by a matrix H G verifying, see [3], 

^span(X + iH)|t=o = ^- 

Under the condition that X^H = 0, it can be proved that 
H is the unique matrix as the representation of tangent vec¬ 
tor H, known as its horizontal lift at the representative X. 
Hence the abstract tangent space TsG{p-, d) at 5 = span(X) 
can be represented by the following concrete set 

TxG{p,d) = {H G = 0}. 

The above representative tangent space is embedded in 
matrix space and it inherits the canonical inner 

VHi,H 2 e Txg{p,d), {Hi,H 2 )x = trace(HfH 2 ). 

Under this metric, the Grassmann manifold is Riemannian 
and the norm term used in (6) becomes 

||H||^ = (H, H)x = trace(H^H) (7) 

which is irrelevant to the point X. Note that H is a tangent 
vector at X on Grassmann manifold. 


Hence 

Usin(S)Q^(Vcos(S)Q^)“i = (Y-XX^Y)(X^Y)“i 
which gives 

Utan(S)V^ = (Y - XX^Y)(X^Y)“i 

because V is actually an orthogonal matrix. Hence the 
algorithm will be conducting a SVD decomposition of 
= (Y - XX^Y)(X^Y)-\ then define 

Logx(Y) = H = U arctan(S) V^. (8) 

4. Solution to GLLR 

With the concrete specifications (7) and (8) for Grass¬ 
mann manifold, the problem for LRR on Grassmann mani¬ 
fold (6) becomes numerically computable. Denote by 

Bik = trace(Logx. (Xj)^Logx. (Xfe)), (9) 

which can be calculated according to the Log algorithm as 
defined in (8) for all the given data X^’s. Then problem (6) 
can be rewritten as 

N 

imny^WiSiwf + A||TY||* 

r' (10) 

= l,i = 1 , 2 , 

f = l 


3.2.2 Log Mapping of Grassmann Manifold 

Log map on general Riemannian manifold is the inverse of 
the Exp map on the manifold [25]. For Grassmann mani¬ 
fold, there is no explicit expression for the Log mapping. 
However the Log operation can be written out by the fol¬ 
lowing algorithm [9, 30]: 

Given two representative Stiefel matrices X, Y of equiv¬ 
alent classes [X], [Y] G G{p,d) as points on the Grass¬ 
mann manifold (matrix representations in d-by-p matri¬ 
ces), we seek to find H at X such that exponential map 
expx(H) = Y. 

Instead of H, we equivalently identify its thin-SVD H = 
USV^. Let us consider these equations, where Y is ob¬ 
tained with the exponential map expx(USV^), 


where Bi = {Bjj^) and Wi = ..., is the row 

vector of W. The augmented Lagrangian objective function 
of (10) is given by 


TV r 1 TV 

i=i [ i=i 


N 




i=i 


( 11 ) 


where pi are Lagrangian multipliers and we have used an 
adaptive constant f3k . 

Denote the first summation term in (11) by F(IU). At 
the current location we take a linearization of F, 


Y = XV cos(S)Q^ - 1 - U sin(S)Q^, 

where Q is any d-by-d orthogonal matrix. 

Consequently, we need to solve the equations of 
U, S, V, and Q, with the knowledge that X^H = 0, since 
H is in the tangent space, 

Vcos(S)Q^ = X^Y, 

Usin(S)Q'^ = Y - XX^Y. 


F{W) + {dF{W^'^'>), W - 

where dF(W^^^) is a gradient matrix with i-th row given 
by 

w^Bi + yfh + - 1)1 ( 12 ) 

i=i 



with 1 G the row vector of all ones. Let riB = 
max{||Bi|p} + + 1. 

Taking this into (11), we define the following iteration 


= argrmn A||1T^||* 


riBpk 
^ 2 


W- - \-dF{W‘^'‘'>)) 

\ riBpk J 


(13) 

2 

F 


Problem (13) admits a closed form solution by using SVD 
thresholding operator [4]. 

The update rules for yi is 


N 

(14) 

i=i 

and the updating rule for P 


Pk-\-l — P/^/c}- (1^) 


0 / 

7 ^ ? 


Figure 1. The MNIST digit samples for experiments. 



Figure 2. DynTex++ samples, each two frames from same video 
sequence. 


where 

Po Pk\\W^^^ -W^\\<ei 

1 otherwise 

We summarize the above as Algorithm 1. 


Algorithm 1 Low-Rank Representation on Grassmann 
Manifold._ 

Input: The Grassmann sample set {Xi}f^^,Xi G G{p,d) 
and the balancing parameter A. 

Output: The Low-Rank Representation Z 
1: Initialize: Set the parameters po = 1.9, = 

max{\\Bi\p} + + 1, ^max = 10^ > ^0 = 0.1, 

Si = le — 4, S 2 = le — 4, = 0, yo = 0. 

2: Prepare all according to (9) 

3 : while not converged do 

4: Calculate dF{W ^^^) row by row according to (12) 

5 : Let Mk = then 

6: Update the according to (13) 

7 : Update by (14) 

8: Update Pk+i by (15) 

9: Check the convergence condition: Pk\\W^~^^ — 

W^W < Si and - 1|| < ^2 

10: end while 


5. Experiments 

In this section, we evaluate the performance of the pro¬ 
posed GLRR on three widely used public databases: (i) 
MNIST handwritten digits image set [24]; (ii) DynTex-F-F 
dynamic texture videos [11]; and (iii) YouTube Celebrity 
faces videos [22, 39]. 


5.1. Data Preparation and Experiment Settings 

MNIST handwritten digits database [24] consists of ap¬ 
proximately 70,000 digit images written by 250 volunteers. 
All the digit images have been size-normalized and centered 
in a fixed size of 28 x 28. This data set can be regarded 
as synthetic as the data are clean. Some samples of this 
database are shown in Figure 1. We use this data set to 
test the clustering performance of the proposed method as 
noise-free cases. 

To test the clustering performance of the proposed 
method for multiple classes, we select DynTex-F-F database 
[11]. The dataset is derived from a total of 345 video se¬ 
quences in different scenarios which contains river water, 
fish swimming, smoke, cloud and so on. These videos are 
labeled as 36 classes and each class has 100 subsequences 
(totally 3600 subsequences) with a fixed size of 50 x 50 x 50 
(50 gray frames). This is a challenging database for clus¬ 
tering because most of texture from different class is fairly 
similar. Figure 2 shows some texture samples. 

Finally YouTube Celebrity (YTC) dataset [22, 39] is cho¬ 
sen to test the performance of the proposed method on data 
with complex variation. YTC, downloaded from Youtube, 
contains videos of celebrities’ joining activities under real- 
life scenarios in various environments, such as news inter¬ 
views, concerts, films and so on. The dataset is comprised 
of 1,910 video clips of 47 subjects and each clip has more 
than 100 frames. It is a quite challenging dataset since the 
faces are of low resolution and contains different head pose 
rotations and rich expression variations. Some samples of 
YTC dataset are shown in Figure 3. 

Our strategy is to represent the image set or videos 
as subspaces objects, i.e., points on Grassmann manifold. 

















Figure 3. YouTube Celebrity samples. Each row includes frames 
from different video sequence of the same person. 

Then a representative basis of such a subspace can be sim¬ 
ply obtained by using SVD from the matrix of the raw fea¬ 
tures of image sets or videos as done in [15, 18]. Concretely, 
given a subset of images, e.g., the same digits written by the 
same person, denoted by {Yi}f£^ and each Yi is a grey-scale 
image with dimension m x n, we can construct a matrix 
r = [vec(yi), vec(l 2)5 •••, vec(yM)] of size {m x n) x M 
by vectorizing each image Yi. Then F is decomposed by 
SVD as r = UYV. We pick up the left p singular-vectors 
of U as the representative of a point on Grassmann manifold 
Q{p,m X n). 

In all the experiments, we use both K-means and NCut 
for the final clustering step over the learned similarity ma¬ 
trix. For the sake of clarity, we summarize all the compared 
methods in Table 1 . Under LRR model, the performance of 
K-means is bad, so we abandon the 4th and last experiment. 
A is an important parameter for balancing the error term and 
the low-rank term of the LRR model on Grassmann Man¬ 
ifold in (10). Empirically, we can make A small when the 
noise of data is lower and use a larger A value if the noise 
level is higher. 

All experiments are implemented by Matlab R201 lb and 
performed on an Xeon-X5675 3.06GHz CPU machine with 
12G RAM. 

5.2. MNIST Handwritten digits Clustering 

To build up subspaces, we create subgroups randomly 
according to their classes so that the each subgroup consists 
of 20 images, i.e. M = 20. We randomly select 40 sub¬ 
groups for each digit, so our task is to cluster N = 400 im¬ 
age subgroups into ten categories. Then construct a Grass¬ 
mann point for each subgroup with p = 10 as mentioned 
above. Finally LRR model on Grassmann Manifold is con¬ 
ducted on these points and the result of Z is pipelined to 
NCut algorithm. As a benchmark comparison, we mimic a 
point in Euclidean for each subgroup of 20 images by stack¬ 
ing them into a vector of dimension 28 x 28 x 20. For PC A 
based methods, the size 28 x 28 x 20 = 15680 of each 
vector is reduced to 323 by PCA. 

The experimental results are reported in Table 2. It is 
shown that our proposed algorithm has the highest accu¬ 
racy of 98%, outperforming other methods more than 10 
percents. Both GNCut and GK-means methods failed in 


this experiment. Our explanation is that, after the Grass¬ 
mann Manifold mapping, it is inappropriate to simply use 
Euclidean distance as a metric for points on the manifold 
space. As our LRR model incorporates the geometry over 
Grassmann manifold, the clustering results are much bet¬ 
ter than that of PNCut or PLRRNCut, in which Euclidean 
distance is used to measure the relation of reduced data un¬ 
der PCA. The manifold mapping extracts more useful in¬ 
formation about the differences among sample data. Thus 
the combination of Grassmann geometry and LRR model 
brings better accuracy for NCut clustering. In our exper¬ 
iments, we set A = 0.3 for the our LRR model and set 
A = 0.5 for PLRR method. 

5.3. Dynamic Texture Clustering 

As video sequences contain useful space and time con¬ 
text information for clustering, instead of using SVD 
method, we use Local Binary Patterns from Three Or¬ 
thogonal Plans (LBP-TOP) model [45] to construct points 
on Grassmann Manifold. The LBP-TOP method extracts 
LBP features from three orthogonal planes and concate¬ 
nate a number of neighbour points’ features to form a co¬ 
occurrence feature. After extracting LBP-TOP features 
for the 3600 subsequences, we get 3600 matrices of size 
177 X 14 as the points on Grassmann Manifold. As the 
data volume of all the 3600 subsequences is huge, we ran¬ 
domly pick K{= 3,...,10) classes from 36 classes and 50 
subsequences for each class to cluster. The experiments are 
repeated several times for each K. In the experiments, A is 
also set to 0.8. For the PCA based method, the prototype 
50 X 50 X 50 subsequence is reduced to 221 and A for the 
PLRRNCut is set to 0.6. 

The clustering results for DynTex-F-F database are shown 
in Figure 4. For different number of classes, the accuracy of 
the proposed method is superior to the other methods more 
than 10 percents, due to incorporating manifold informa¬ 
tion over the LBP-TOP features. We also observe that all 
of accuracies decreases as the number of classes increases. 
This may be caused by the clustering challenge when more 
similar texture images are added into the data set. 

5.4. YouTube Celebrity Clustering 

In order to create a face dataset form the YTC videos, a 
face detection algorithm is exploited to extract face regions 
and resize each face to a 20 x 20 image. We treat the faces 
extracted from each video as an image set without thinking 
about the number M of frames in different videos. LBP- 
TOP is not appropriate for the extracted face images in this 
case as we have lost relative spatial information, so we use 
SVD method to construct points on Grassmann Manifold 
like the handwritten database. We only test the algorithm 
on AT = 4 classes with = 218 video clips as an example 
for this complicated data case. For the PCA based method. 




Data processing 
method 

Data representation 
method 

Clustering method 

Combining method 

PGA 

- 

NCut 

PNCut 

PGA 

- 

K-means 

PK-means 

PGA 

LRR 

NCut 

PLRRNCut 

PGA 

LRR 

K-means 

- 

Grassmann Manifold 

- 

NCut 

GNCut 

Grassmann Manifold 

- 

K-means 

GKmeans 

Grassmann Manifold 

LRR 

NCut 

GLRRNCut(our method) 

Grassmann Manifold 

LRR 

K-means 

- 


Table 1. Different combining clustering methods with variety in data processing, data representation and clustering methods. 



PK-means 

PNCut 

PLRRNCut 

GK-means 

GNCut 

Our method 

MNIST database 
YTC database 

0.71 

0.6378 

0.8675 

0.5189 

0.8552 

0.2020 

0.47 

0.3568 

0.385 

0.3568 

0.9833 

0.6919 


Table 2. Subspace clustering results on MNIST and YTC databases. 



Figure 4. Clustering results on DynTex++ database with the num¬ 
ber of classes ranging from 3 to 10. 

the size of each subsequence is reduced to 3000 and A for 
the PLRRNCut is set to 0.6. 

The clustering results for YTC face dataset is shown in 
Table 2. The accuracy of our method is significantly higher 
than other methods although it is relatively lower than the 
accuracy for DynTex dataset. One of key reasons for this 
lower accuracy may be due to rapid changes existed in face 
images from a clip. 

6. Conclusion 

In this paper, building on Grassmann manifold geometry 
and the Log mapping, the classic LRR model is extended 
for object subspaces in Grassmann manifold. The data self 
expression used in LRR is implemented over the tangent 
spaces at points on Grassmann manifold, thus a novel LRR 
on Grassmann manifold, namely GLRR, is presented. An 


efficient algorithm is also proposed for the GLRR model. 
Our experiments on image sets and video clip databases 
confirm that GLLR is well suitable for representing non¬ 
linear high dimensional data and revealing intrinsic multiple 
subspaces structures in clustering applications. The exper¬ 
iments also demonstrate that the proposed method behaves 
effectively and steadily in variety of complicated practical 
scenarios. As part of future work, we will explore the LRR 
model on Grassmann manifold in an intrinsic view. 
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